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Sortilin is a type I membrane glycoprotein belonging to the 
vacuolar protein sorting 10 protein (VpslOp) family of sorting 
receptors and is most abundantly expressed in the central 
nervous system. Sortilin has emerged as a key player in the 
regulation of neuronal viability and has been implicated as a 
possible therapeutic target in a range of disorders. Here, the 
identification of AF40431, the first reported small-molecule 
ligand of sortilin, is reported. Crystals of the sortilin- AF40431 
complex were obtained by co-crystallization and the structure 
of the complex was solved to 2.7 A resolution. AF40431 is 
bound in the neurotensin-binding site of sortilin, with the 
leucine moiety of AF40431 mimicking the binding mode of 
the C-terminal leucine of neurotensin and the 4-methyl- 
umbelliferone moiety of AF40431 forming jr-stacking with a 
phenylalanine. 



1. Introduction 

Sortilin is a 100 kDa type I membrane glycoprotein belonging 
to the vacuolar protein sorting 10 protein (VpslOp) receptor 
family. The receptor is composed of a large extracellular 
domain (75 kDa), a single transmembrane helix and a short 
cytoplasmic tail (Petersen et ai, 1997). The extracellular 
VpslOp domain is composed of a /3-propeller and a cysteine- 
rich (10CC) domain (Quistgaard et ai, 2009). The ^-propeller 
is composed of ten repeats (blades), forming a wheel-like 
structure with a funnel-shaped tunnel with a diameter of 
25-40 A (Fig. 4). Each blade is composed of a four-stranded 
/6-sheet and is stabilized by an Asp-box repeat in the loop 
between strands 3 and 4 (Quistgaard & Thirup, 2009). Sortilin 
is a key player in the regulation of neuronal viability and 
function and is widely expressed in the central nervous system 
(CNS) during development and in adulthood (Petersen et ai, 
1997; Hermans-Borgmeyer et ai, 1999). The receptor is mainly 
located in intracellular membrane structures (endoplasmic 
reticulum, trans-Golgi network and endosomal organelles) 
involved in the sorting of ligands. It has been estimated that a 
tenth of the cellular sortilin pool is found at the cell surface 
(Mazella et ai, 1998), mediating endocytosis of ligands 
through clathrin-coated pits (Nielsen et ai, 2001). The 
13-amino-acid neuropeptide neurotensin (NTS) is a ligand of 
the G-coupled neurotensin receptors (NTSRs), resulting in 
dopamine turnover and muscle relaxation, but is also bound 
by sortilin in the extracellular environment (Munck Petersen 
et al. , 1999). NTS is bound to sortilin in the centre of the tunnel 
of the yfi-propeller and the binding site is composed of a 
/3-strand interaction between the C-terminus (amino acids 10- 
13) of neurotensin and strand 1 of blade 6 in sortilin. A strong 
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salt bridge is formed between the carboxylic acid of the 
C-terminus and Arg292 (Quistgaard et al, 2009). 

A notable function of sortilin is its role in signalling by 
neurotrophins, a family consisting of nerve growth factor 
(NGF), brain-derived neurotrophic factor (BDNF) and 
neurotrophins 3 and 4 (S-3 and NT-4) in humans. The hall- 
mark of neurotrophin signalling, neuronal survival, is mediated 
through binding of mature neurotrophin to tyrosine receptor 
kinases (Trks; Wu et al, 2002). The affinity and specificity of 
neurotrophin binding is modulated by the presence of the low- 
affinity neurotrophin receptor p75 (p75NTR) acting as a co- 
receptor (Esposito et al, 2001; Hempstead et al, 1991; Bibel 
et al, 1999). Neurotrophins are expressed with a pro-domain, 
which is cleaved off in the Golgi, resulting in secretion of the 
mature domain (Scott et al, 1983). However, the release of 
unprocessed proneurotrophins results in signalling of apop- 
tosis via p75NTR (Lee et al, 2001) and sortilin (Nykjaer et al, 
2004; Teng et al, 2005; Tauris et al, 2011). The mature part of 
the neurotrophin is bound by p75NTR, whereas the propep- 
tide is bound by sortilin, thereby forming a high-affinity (K d 
of ~160 pM) heterotrimeric complex (Nykjaer et al, 2004). 
Knockout studies have supported the roles of p75NTR (Frade 
& Barde, 1999) and sortilin (Jansen et al, 2007) in proneuro- 
trophin-induced apoptosis signalling. Proneurotrophins are 
released to the extracellular environment in animal models of 
neurodegenerative conditions, including mechanical damage 
to the nervous system, focal ischaemia, axotomy, stroke and 
epileptic seizures (Dechant & Barde, 2002), and proNGF is 
detected in vivo for several days following injury (Jansen et al, 
2007). Elevated levels of proNGF (Harrington et al, 2004), 
p75NTR (Beattie et al, 2002) and sortilin (Jansen et al, 2007) 
are observed in acute injury models. These upregulations have 
been suggested to keep the nervous system functional by the 
clearance of damaged neurons (Lu et al, 2005). Interestingly, 
elevated levels of proNGF have been identified in patients 
diagnosed with Alzheimer's disease (Fahnestock et al, 2001). 
The levels of proneurotrophin in the extracellular environ- 
ment are controlled by plasmin and matrix metalloproteases 
(Lee et al, 2001). Elevated levels of inhibitors of the extra- 
cellular proteases are found under conditions resembling 
neurodegenerative diseases (Bruno & Cuello, 2006). 

Outside the CNS, proNGF is overproduced and secreted 
from malignant breast cancer cells, thereby inducing autocrine 
stimulation via TrkA and sortilin (Demont et al, 2012), and 
similar autocrine stimulation has been observed in melanoma 
cells (Truzzi et al, 2008). Sortilin has further been demon- 
strated to interact with apolipoprotein B100 in the Golgi and 
to facilitate the export of apoBlOO-containing lipoproteins, 
thereby regulating the levels of low-density lipoprotein (LDL) 
cholesterol, a key contributor to atherosclerosis and ischaemic 
heart disease, in plasma (Kjolby et al, 2010). 

The interaction between sortilin and the majority of its 
ligands, including the proneurotrophins, is inhibited by NTS 
(Nykjaer et al, 2004; Munck Petersen et al, 1999). Hence, a 
small-molecule ligand mimicking the binding mode of NTS 
could potentially be applied as a treatment in several disorders 
in which sortilin has been implicated. Here, we describe the 



identification of the first small-molecule ligand of sortilin, 
AF40431, and the structure determination of the receptor- 
ligand complex. 



2. Materials and methods 

2.1. Synthesis and characterization of AF40431 and 1h 

Compounds la-lg were obtained directly from the H. 
Lundbeck compound collection. AF40431 and lh were 
prepared by reductive alkylation and deprotection of l- and 
D-leucine rert-butyl esters with the corresponding coumarin 
aldehyde (7-hydroxy-4-methyl-2-oxo-2//-chromene-8-carb- 
aldehyde). 

Sodium triacetoxyborohydride [NaBH(OAc) 3 ] (1.55 g, 
7.34 mmol) was added to a solution of L-leucine fert-butyl 
ester hydrochloride (0.50 g, 2.44 mmol) and the coumarin 
aldehyde (0.68 g, 2.93 mmol) in dichloroethene (DCE; 50 ml) 
and acetic acid (0.14 g, 2.44 mmol) and the reaction mixture 
was stirred at room temperature for 16 h. Aqueous sodium 
bicarbonate (NaHC0 3 ) solution (15 ml) was added and the 
mixture was extracted with ethyl acetate (2 x 50 ml), washed 
with brine and dried over anhydrous sodium sulfate (Na 2 S0 4 ). 
Solvents were removed in vacuo and the crude compound was 
purified by silica gel (100-200 mesh) column chromatography, 
eluted with 16% of ethyl acetate in petroleum ether, to afford 
the ferf-butyl ester of AF40431 (0.37 g, 40%) as a pale yellow 
solid. : H NMR (CDC1 3 , 600 MHz, TMS) 8: 7.43-7.41 (1H, d, 
/ = 8.7 Hz), 6.80-6.78 (1H, d, / = 8.7 Hz), 6.07 (1H, s), 4.30- 
4.26 (1H, d, / = 14.9 Hz), 4.18-4.15 (1H, d, / = 14.9 Hz), 3.25- 
3.23 (1H, t, / = 7.1 Hz), 2.38 (3H, br s), 1.78-1.71 (1H, sp, 
/ = 6.1 Hz), 1.56-1.49 (2H, m), 1.51 (9H, s), 0.95-0.93 (3H, d, 
/ = 6.6 Hz), 0.92-0.90 (3H, d, / = 6.6 Hz). 

The fert-butyl ester of AF40431 (0.150 g, 0.39 mmol) was 
taken up in 1:1 dichloromethane (DCM)/trifluoroacetic acid 
(TFA) (2 ml) and stirred at room temperature for 3 h. The 
solvents were removed in vacuo and the crude compound was 
washed with diethyl ether (2x5 ml) and methanol (2x5 ml) 
and dried in vacuo to afford AF40431 (0.091 g, 72%) as a 
white solid. X H NMR (DMSO-d 6 , 600 MHz, TMS) 8: 9.95-8.95 
(1H, br s), 7.58-7.56 (1H, d, J = 8.8 Hz), 6.82-6.80 (1H, d, 
/ = 8.8 Hz), 6.15 (1H, s), 4.14-4.10 (1H, d, /= 14.4 Hz), 4.04- 
4.00 (1H, d, /= 14.4 Hz), 3.27-3.22 (1H, m), 2.37 (3H, s), US- 
UI (1H, sp, /= 6.5 Hz), 1.53-1.43 (2H, m), 0.89-0.87 (3H, d, 
/ = 6.5 Hz), 0.86-0.84 (3H, d, /= 6.5 Hz). Mass spectrum: tR = 
0.50 min, mlz = 320.0 (M + H) + . 

Compound lh was prepared in an analogous manner to 
AF40431, starting from D-leucine ferf-butyl ester hydro- 
chloride. *H NMR spectra were recorded at 600 MHz using a 
Bruker Avance AV-III-600 instrument. Chemical shift values 
are expressed in p.p.m. relative to tetramethylsilane. The 
following abbreviations or their combinations are used for the 
multiplicity of NMR signals: br, broad; d, doublet; m, multi- 
plet; obs, obscured; s, singlet, sp, septet; t, triplet. LC-MS was 
run on a Waters Aquity UPLC-MS consisting of a column 
manager, a binary solvent manager, a sample organizer, a 
PDA detector (operating at 254 nm), an ELS detector and an 
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SQ-MS equipped with an APPI source operating in positive- 
ion mode. The column was an Acquity UPLC BEH C18 
(1.7 urn, 2.1 x 50 mm) operating at 60°C with a 1.2 ml min -1 
binary gradient consisting of water/0.1% formic acid (solvent 
A) and acetonitrile/5% water/0.1% formic acid (solvent B). 
The gradient changes between 10% solvent B and 99.9% 
solvent B over the 1 min run. 

2.2. Chiral supercritical fluid chromatography 

Chiral supercritical fluid chromatography (SFC) was run 
on an Aurora Fusion A5 unit together with Agilent 1100/1260 
modules using a Phenomenex Lux 3u Cellulose-2 column, 4.6 
x 250 mm, flow 3 ml min -1 under 15 MPa pressure at 40°C. 
Ethanol and 0.1% diethylamine were used as modifiers at 20% 
concentration and UV detection operated at 254 and 230 nm. 
In order to demonstrate that the compounds were enantio- 
merically pure, AF40431 and lh were separately treated with 
TMS-CH 2 N 2 in THF to afford the corresponding methyl esters 
with identical NMR spectra: : H NMR (CDC1 3 , 600 MHz, 
TMS) <5: 7.43-7.41 (1H, d, / = 8.5 Hz), 6.81-6.79 (1H, d, / = 
8.5 Hz), 6.08 (1H, d,/= 1.1 Hz), 4.37-4.34 (1H, d,/= 14.1 Hz), 
4.11-4.07 (1H, d,/= 14.1 Hz), 3.87 (3H, s), 3.42-3.38 (1H, m), 
2.39-2.38 (3H, d, / = 1.2 Hz), 1.79-1.71 (1H, sp, / = 6.8 Hz), 
1.58-1.55 (2H, obs m), 0.95-0.94 (3H, d,/= 6.8 Hz), 0.93-0.92 
(3H, d, / = 6.8 Hz). Mass spectrum: tR = 0.45 min, mlz = 334.1 
(M + H) + . These methyl esters were submitted to chiral SFC 
both in their pure forms and as a 1:1 mixture of enantiomers. 
The pure sample of the methyl ester derivatives of AF40431 
and lh displayed unique retention times (tR = 2.85 and 
3.34 min, respectively) by chiral SFC, while two distinct signals 
were observed for the mixed sample (tR = 2.84 and 3.33 min). 
As the homochiral starting materials l- and D-leucine afforded 
the homochiral products AF40431 and lh, respectively, it can 
be inferred that AF40431 has the (S) absolute configuration 
of L-leucine; as to not do so would have required complete 
stereochemical inversion during one of the chemical trans- 
formations, which would be both highly unlikely and unpre- 
cedented for these transformations. The absolute configuration 
of AF40431 was not unambiguously characterized. 

2.3. Sortilin expression and purification 

The open reading frame of human sortilin including the 
endogenous signal peptide to residue 723 was amplified by 
PCR and cloned into the pCEP-pu vector with Xbal and 
Xhol. The primers were designed to include a C-terminal 
factor Xa site followed by a hexahistidine tag. HEK293F cells 
were transfected with 900 ug plasmid, applying 900 ul Free- 
Style MAX reagent (Invitrogen) according to the manufac- 
turer's instructions, and were grown in 900 ml FreeStyle 293 
medium (Invitrogen) in a 3 1 Erlenmeyer flask for 7 d. The 
medium was harvested by centrifugation at 3700g for 15 min, 
filtered through a 0.40 um filter and the pH was adjusted by 
supplementation with Tris-HCl pH 8.0 to a final concentration 
of 50 mM. A 5 ml Ni-NTA column (GE Healthcare) was 
equilibrated in buffer A (50 mM Tris-HCl pH 8.0, 150 mM 
NaCl) and the medium was recirculated over the column 



overnight. The column was washed with five column volumes 
of wash buffer (50 mM Tris-HCl pH 8.0, 150 mM NaCl, 
10 mM imidazole) and sortilin was eluted with a ten-column- 
volume imidazole gradient (10-250 mM imidazole). Fractions 
containing sortilin were pooled and concentrated by ultra- 
centrifugation (Vivaspin 20, 10 kDa cutoff, Sartorius). The 
sortilin was applied onto a Superdex 200 10/300 GL column 
(GE Healthcare) equilibrated in buffer A. Fractions 
containing sortilin were pooled, concentrated to a final 
concentration of 5 mg ml -1 , flash-frozen in liquid nitrogen and 
stored at -80° C. 



2.4. Scintillation proximity assay 

The compound affinity was determined by measuring the 
displacement of 3 H-neurotensin binding to sortilin using a 
scintillation proximity assay (SPA). Experiments were 
performed in assay buffer (50 mM HEPES pH 7.4, 100 mM 
NaCl, 2.0 mM CaCl 2 , 0.1% BSA, 0.1% Tween-20) with a total 
volume of 40 ul. The compound was pre-incubated for 30 min 
at room temperature with 150 nM sortilin before 5 nM 
3 H-neurotensin and Ni-chelate imaging beads (Perkin Elmer) 
were added. After 6 h, the plate was read on a ViewLux with 
360 s exposure time. Unlabelled neurotensin and a DMSO 
blank were used as positive and negative controls, respectively. 
Dose-response evaluation of compounds was performed with 
ten concentrations of la-lh and AF40431 (covering concen- 
trations between 2.5 nM and 50 uM). The half-maximal 
inhibitory concentration (IC 50 ) values were calculated by non- 
linear regression using the sigmoid concentration response 
(variable slope) in XLfit 4 (IDBS, UK). All values reported 
are the averages of at least four determinations. 



2.5. Isothermal titration calorimetry 

The binding of AF40431 to sortilin was measured by 
isothermal titration calorimetry (ITC). The titration experi- 
ments were performed on a MicroCal iTC200 isothermal 
titration calorimeter (MicroCal, Northampton, Massachusetts, 
USA). Sortilin was dialysed against PBS buffer pH 7.4. 
AF40431 was dissolved in PBS buffer pH 7.4 supplemented 
with 4%(v/v) dimethyl sulfoxide (DMSO). All solutions were 
filtered and degassed to avoid bubble formation and were 
equilibrated to the corresponding temperature before each 
experiment. Sortilin (30 uM) supplemented with 4%(v/v) 
DMSO was titrated at 25°C with AF40431 (500 uM) in 20 
steps of 2 ul (first step, 0.4 ul). The time between injections 
was set to 150 s and the syringe mixing speed was set to 
1000 rev min -1 . Heat evolving from dilution was measured by 
injecting the ligand into PBS supplemented with 4% DMSO. 
This heat was subtracted from the heat of reaction to obtain 
the effective heat of binding. Finally, the binding stoichiometry 
(n), equilibrium dissociation constant (K A ), molar enthalpy 
and entropy changes for the binding processes were deter- 
mined by analyzing the titration data using Origin software 
(OriginLab, Northampton, Massachusetts, USA). 
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2.6. AF40431 fluorescence spectroscopy 

AF40431 and sortilin were diluted in PBS buffer pH 7.4 to a 
final concentration of 0.1 uM. Fluorescence was recorded on a 
RF-5301PC spectrofluorophotometer (Shimadzu) maintained 
at 23 ± 1°C using 3 and 5 nm excitation and emission slit 
widths, respectively. The excitation and emission wavelengths 
were determined by scanning the ranges 250-430 and 370- 
600 nm, respectively. 

2.7. Sortilin crystallization, data collection and refinement 

Initial screening was performed using in-house poly- 
ethylene glycol (PEG) screens designed with the Mimer 
spreadsheet (Brodersen et al, 2013). Sortilin was pre- 
incubated with a ten-molar excess of AF40431 for 1 h. 200 nl 
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sortilin- AF40431 was mixed with 200 nl reservoir solution 
using a Mosquito liquid-handling robot (TTP LabTech) and 
equilibrated against 100 ul reservoir solution using the sitting- 
drop vapour-diffusion method at 292 K. Several crystal hits 
were obtained and were further optimized by grid screening 
and varying the molar excess of AF40431 in 24- well crystal- 
lization plates. The best diffracting crystals were obtained with 
a 20-molar excess of AF40431 in 0.1 M HEPES-Tris pH 7.3, 
0.4 M sodium malonate, 26% (Wv) PEG 3350 (Hampton 
Research), 8%(v/v) glycerol. The crystals grew to dimensions 
of 200 x 100 x 50 pm over two weeks. The crystals were 
mounted in LithoLoops (Molecular Dimensions) from the 
mother liquor, excess mother liquor was removed by gently 
touching the side of the well with the edge of the loop (Morth 
et al. , 2007) and the crystals were flash-cooled in liquid N 2 . A 
complete single-wavelength (X = 0.9 A) data set of 1100 
oscillation images with 0.2° oscillation was collected at 100 K 
on the X06DA (PXIII) beamline at the Swiss Light Source 
(SLS) using a PILATUS 2M-F detector (Dectris). The 
diffraction images were processed in XDS (Kabsch, 2010) and 
the reflections were scaled in SCALA (Winn et al, 2011). 
Molecular replacement was performed with Phaser (McCoy et 
al, 2007) using a search model derived from the structure of 
sortilin in complex with neurotensin (PDB entry 3f6k; Quist- 
gaard et al, 2009). Rigid-body refinement, generation of 
ligand coordinates and restraints, calculation of OMIT maps 
and refinement were performed in P HEN IX (Adams et al, 
2010). Model building and analysis was performed using Coot 
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Figure 1 

Identification of AF40431. (a) Chemical structure and half-maximal inhibitory concentration (IC 50 ) values determined in the SPA assays of the hit 
compound la and its analogues (lb-lg). The stereochemistry of the samples in the compound collection was undefined [i.e. (+), (— ) or (±)]. (b) 
Chemical structure of Calcein Blue (left) and tridentate metal (M) chelation by Calcein Blue (right), (c) Chemical structure and IC 50 values determined 
from the SPA assay of AF40431 and its stereoisomer lh. Chemical structure were drawn using ChemSketch (ACD/Labs; http://www.acdlabs.com). 
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Table 1 

Data-collection and refinement statistics for the sortilin-AF40431 
complex. 

Values in parentheses are for the highest resolution shell. 
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(Emsley et al, 2010). The final model quality was analysed 
using MolProbity (score of 11.77; Chen et al, 2010). R.m.s.d. 
values were calculated, superpositions were performed and 
structural figures were prepared using PyMOL (Schrodinger; 
http://www.pymol.org). 



3. Results 

3.1. Identification of AF40431 

Compound la, a promising hit with an IC 50 of 1.8 uM 
(Fig. la), was identified in a high-throughput screen of the 
H. Lundbeck compound collection using the 3 H-neurotensin- 
binding sortilin scintillation proximity assay (SPA), la is 
closely related to the dye Calcein Blue (Fig. lb) and could thus 
potentially cause assay interference by fluorescence. Further- 
more, molecules of the general type 1 have been reported 
(Wilkins, 1960) to ligate metals via a tridentate binding 
interaction (compound 2; Fig. 15), which could also potentially 
cause assay interference. However, screening of analogues of 
la revealed a promising structure-activity relationship, which 
was consistent with specific molecular recognition rather than 



chemical-mediated assay interference (Fig. la). Specifically, 
the activity was retained when the leucine-type side chain of 
la (CH 2 'Pr) was exchanged to the isoleucine-type side chain 
of lb [CH(Me)Et], the neopentyl substituent of lc or the 
cyclohexyl substituent of Id. In contrast, the activity was 
ablated by either substitution with the more compact valine - 
and alanine -type side chains of le ('Pr) and If (Me), 
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Figure 2 

Thermodynamics of the binding of AF40431 to sortilin. (a) The raw 
titration data and the integrated titration curves are depicted in the upper 
and lower panels, respectively. The final values of the determined 
thermodynamic parameters are listed in the lower panel, (b) Binding 
signature plot listing the enthalpic (AH) and entropic (-TAS) 
contributions to the free binding energy (AG). 
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respectively, or the bulkier phenylalanine-type side chain of lg 
(Bn). The stereochemistry of the samples in the compound 
collection (la-lg) was undefined and therefore the influence 
of the stereochemistry on the activity was explored for la. The 
activity was found to reside in only one enantiomer, AF40431, 
whereas the corresponding antipode lh was inactive (Fig. lc). 
Such enantiospecific activity is consistent with a genuine 
molecular-recognition event with the target protein rather 
than assay interference, which would be expected to afford 
similar effects for both enantiomers. The activity of racemic 
sample la and the homochiral AF40431 were identical within 
experimental error. The binding of AF40431 to sortilin was 
furthermore confirmed by ITC with a dissociation constant 
(K d ) of 0.7 \i.M (Fig. 2a), and the free binding energy is driven 
by both enthalpic and entropic contributions (Fig. 2b). These 
structure-activity relationships gave us sufficient confidence in 
AF40431 to attempt co-crystallization with sortilin. 




(b) 

Figure 3 

Electron-density maps of AF40431. (a) Bias-reduced simulated-annealing 
F a — F c map contoured at the 3.0a level, (b) Final 2F a — F c map 
contoured at the 1.5a level. 
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3.2. Crystallization of the sortilin-AF40431 complex 

The extracellular VpslOp domain of sortilin was over- 
expressed in HEK293F cells and purified from the medium 
by immobilized metal-affinity chromatography. Sortilin was 
further purified by size-exclusion chromatography, eluting as a 
single symmetrical peak corresponding to monomeric sortilin. 
Crystals of the sortilin- AF40431 complex were obtained by 
co-crystallization. Initial crystallization hits were identified 
in an in-house polyethylene glycol (PEG) screen and were 
further optimized by grid screening and increasing the molar 
excess of AF40431. A complete data set was collected, scaling 
to a maximum resolution of 2.7 A. The crystals of the sortilin- 
AF40431 complex belonged to the monoclinic space group C2, 
with unit-cell parameters a = 160.9, b = 79.2, c = 111.7 A, 
/8 = 127.3°. The data statistics are summarized in Table 1. 

3.3. Structure determination of the sortilin-AF40431 
complex 

The atomic coordinates of the published sortilin-neurotensin 
complex structure (PDB entry 3f6k; Quistgaard et al, 2009), 
stripped of NTS, glycosylations and waters, was applied as a 
search model to obtain initial phases by molecular replace- 
ment (MR), yielding a translation-function Z-score (TFZ) of 
66.5. The asymmetric unit was composed of a single molecule. 
The molecular-replacement solution was subjected to rigid- 
body and simulated-annealing refinement. Previously identi- 
fied glycosylations (Quistgaard et al., 2009) were easily iden- 
tified in the resulting bias-reduced F Q — F c electron-density 
maps, thereby validating the MR solution. Furthermore, 
additional density was observed in the NTS-binding site in the 
bias-reduced F a — F c electron-density maps which remained 
during the building of glycosylations and subsequent rounds of 




Figure 4 

Overview of AF40431 binding to sortilin. The ten blades of the 
/S-propeller and the 10CC domain are represented as grey and green 
cartoons, respectively. Neurotensin was superimposed from PDB entry 
3f6k (Quistgaard et al, 2009) and is depicted as a purple ball-and-stick 
representation; AF40431 is depicted as orange sticks and spheres. 
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refinement. Coordinates and geometric restraints were 
generated for AF40431, and the ligand could unambiguously 
be modelled into the density observed at the NTS-binding site. 
The structure was refined, resulting in good R factors (i? work 
and R bee of 20.7% and 22.8%, respectively; Table 1). Binding 
of AF40431 was confirmed by bias-reduced simulated- 
annealing OMIT maps (Fig. 3a) and the density for AF40431 
was well defined in the final 2F a — F c maps (Fig. 3b). 

3.4. AF40431 binding to sortilin 

AF40431 is bound to sortilin in the centre of the tunnel of 
the /3-propeller at the NTS-binding site (Fig. 4). The overall 
structure of the AF40431-sortilin complex resembled that of 
the NTS-sortilin complex (r.m.s.d. of 0.55 A for C™ atoms), 
with the 10CC domain wrapped around the yf3-propeller. The 
leucine moiety of AF40431 is bound to sortilin in the same way 




P10 



as that of NTS, with the carboxylic acid forming a salt bridge 
to Arg292. The orientation of the carboxylic acid is further 
stabilized by hydrogen bonds to Ser283 and to the amide of 
Tyr318 (Fig. 5a). A hydrogen bond is formed between the 
secondary amine of AF40431 and the carbonyl of Tyr318 
(Fig. 56). The isopropyl group of AF40431 is buried in the 
hydrophobic pocket formed by Ser272, Phe281, Ile294 and 
Ile320, but is rotated 120° compared with that of NTS. The 
2-pyrone of the 4-methylumbelliferone moiety of AF40431 
forms an off-centre parallel ^-stacking interaction with 
Phe317 (Fig. 5b). 



4. Discussion 

A promising hit, compound la, was identified from the H. 
Lundbeck compound collection by screening the displacement 
of 3 H-NTS binding to sortilin in a scintillation proximity assay. 
We were initially suspicious of the activity of la and were 
concerned that the hit could be spurious owing to assay 
interference resulting from some potentially problematical 
structural features of the molecule. A structural alert against 
the pan assay interference compounds (PAINS) filters for 
assay interference is triggered by molecules of type 1 owing 
to their potential to undergo a retro-Mannich reaction and 
liberate reactive species (Baell & Holloway, 2010). However, 
a sound structure-activity relationship was identified for six 
analogues of la (lb-lg). Some variation in the length and size 
of the a-alkyl substituent of la was allowed (lb-Id), whereas 
smaller (le and If) and larger (lg) substituents ablated activity 
(Fig. la). The activity of la was shown to be enantiospecific, 
with all activity residing in a single enantiomer, namely 
AF40431, whereas the corresponding antipode lh was inactive 
(Fig. lc). The direct binding of AF40431 to sortilin was 
confirmed by ITC. The observed K d (0.70 \iM) is fivefold 



R292 



S283 




Y318 




AF40431 



3S27 

**F281 



(b) 



(c) 



Figure 5 

Binding mode of AF40431 to sortilin. (a, b) Sortilin is represented as a cartoon in grey and the residues interacting with AF40431 are represented as 
sticks. C atoms of sortilin, AF40431 and NTS are depicted in grey, orange and purple, respectively, and N and O atoms are depicted in blue and red, 
respectively, (c) Schematic diagram of hydrogen bonds (black punctured lines) and hydrophobic interactions (grey arches with spokes) between 
AF40431 and sortilin prepared in LigPlot* (Laskowski & Swindells, 2011). 
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higher than the K d of NTS (0.15 \iM), with the higher affinity 
of NTS originating from the hydrogen bond between Tyrll of 
NTS and Lys227 of sortilin (Quistgaard et al, 2009) and from 
contributions from the N-terminal region of NTS to binding 
(Quistgaard, 2014). 

Crystals of the AF40431-sortilin complex were obtained by 
co-crystallization. A complete data set scaling to 2.7 A reso- 
lution was collected and the structure was solved by molecular 
replacement. Despite belonging to the same space group as 
the sortilin-NTS complex structure, a different packing of the 




molecules (Supplementary Fig. SI), similar to that of the 
selenomethionine-derivative and heavy-metal-derivative 
crystals of the sortilin-NTS complex, was observed (Quist- 
gaard et al, 2009). AF40431 is bound to sortilin at the NTS- 
binding site and the leucine moiety of the ligand is bound to 
sortilin in the same way as the leucine of NTS (Fig. 4). 

Overall, the binding of AF40431 to sortilin is driven by four 
components: electrostatic interaction and hydrogen bonds 
between the carboxylic acid of AF40431 and Arg292, Ser283 
and Tyr318, a hydrogen bond between the secondary amine 
of AF40431 and the carbonyl of Tyr318, hydrophobic inter- 
actions between the isopropyl group and Ser272, Phe281, 
Ile294 and Ile320, and 7r-stacking between the 2-pyrone and 
Phe317 (Fig. 5c). The observed interactions are in good 
agreement with the free binding energy being driven by both 
enthalpic (electrostatic and hydrogen-bond interactions) and 
entropic (hydrophobic interaction) contributions (Fig. lb). 

The selective stereochemistry of AF40431 is defined by the 
hydrogen bond between the secondary amine of AF40431 
and the carbonyl of Tyr318, the electrostatic interaction with 
Arg292 and the hydrophobic interaction of the isopropyl 
group. The simultaneous interaction with these three compo- 
nents is excluded by the stereochemistry of lh (Fig. 5). 

The structure-activity relationships obtained in the NTS 
scintillation proximity assay can be explained well by the size 
of the leucine -binding pocket of sortilin. The width of the 
pocket is determined by Ser272, Ile294 and Ile320, whereas 
the depth is determined by Phe281, which forms the 'floor' of 
the pocket (Fig. 6a). The pocket allows substitutions of similar 
size, namely the isoleucine-type side chain of lb [CH(Me)Et] 
and the neopentyl and cyclohexyl substituents of lc and Id, 
respectively. The smaller isopropyl and methyl substituents of 
le and If, respectively, would not be expected to reach into the 
hydrophobic pocket and the bulkier benzyl substituent of lg 
(CH 2 Ph) would be expected to sterically clash with the 'floor' 
formed by Phe281 (Fig. 6b). 
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(a) The hydrophobic leucine-binding pocket. Sortilin is represented as a surface and is coloured according to atom type (C, N and O atoms in grey, blue 
and red, respectively). AF40431 is shown as a ball-and-stick representation in orange. The depth of the hydrophobic pocket is determined by Ser272, 
Phe281, Ile294 and Ile320. (b) Superposition of AF40431 and NTS in the structure of the NTSR1-NTS complex (PDB entry 4grv) represented as a 
cartoon in green (White et al., 2012). The 4-methylumbelliferone moiety of AF40431 clashes with Phel28 in TM2 and Tyr351 in TM7. (c) Combined 
excitation and emission fluorescence spectrum for AF40431 (full lines) and the sortilin- AF40431 complex (dotted lines). 
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The phenol group of the 4-methylumbelliferone moiety of 
AF40431 is located in the position corresponding to that of 
Ilel2 of NTS in the sortilin-NTS complex (Figs. 5a and 5b) 
and, similar to Ilel2, the phenol does not seem to contribute 
to the binding (Quistgaard et al, 2009). A parallel 7T-stacking 
interaction is formed between the 2-pyrone of the 4-methyl- 
umbelliferone moiety of AF40431 and sortilin. An off-centre 
stacking is caused by the two electron-rich systems of 2-pyrone 
and Phe281 (Hunter & Sanders, 1990). The hydroxyl, ketone 
and methyl groups of 4-methylumbelliferone are not involved 
in interactions with sortilin and hence the structure-activity 
relationship should be explored for this two-ring system. 

NTS binds to G-coupled neurotensin receptors (NTSRs) 
via amino acids 8-13, with the C-terminal leucine forming an 
electrostatic interaction with an arginine in the binding site 
(White et al, 2012). Hence, off-target reactions of AF40431 
with the NTSRs could be expected. However, the Ilel2 pocket 
of NTSR1 is narrow and steric clashes between the 4-methyl- 
umbelliferone moiety of AF40431 and Phel28 of transmem- 
brane helix 2 (TM2) and Tyr351 of TM7 in NTSR1 were 
identified when AF40431 was superimposed on NTS in the 
NTSR1-NTS complex structure (White et al, 2012; Fig. 6b). 
No competition of NTS binding to NTSR1 was observed for 
AF40431 (IC 50 > 50 \iM; data not shown). 

The Calcein Blue moiety has previously been applied as a 
tracer in cellular and tissue studies (O'Malley et al, 1999). The 
fluorescent property of Calcein Blue is retained in AF40431 
and the fluorescence is not statically quenched by the 
7T-stacking with Phe317 upon binding to sortilin (Fig. 6c). 
Hence, AF40431 could be applied as a molecular tracer of 
sortilin. The identification of AF40431 suggests that sortilin is 
ligandable by small molecules and thus provides a stimulus to 
further small-molecule ligand discovery efforts for sortilin. 
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